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Abstract — A centralized wireless system is considered that is 
serving a fixed set of users with time varying channel capacities. 
An opportunistic scheduling rule in this context selects a user 
(or users) to serve based on the current channel state and user 
queues. Unless the user traffic is symmetric and/or the underlying 
capacity region a polymatroid, little is known concerning how 
performance optimal schedulers should tradeoff maximizing cur- 
rent service rate (being opportunistic) versus balancing unequal 
queues (enhancing user-diversity to enable future high service 
rate opportunities). By contrast with currently proposed oppor- 
tunistic schedulers, e.g., MaxWeight and Exp Rule, a radial sum- 
rate monotonic (RSM) scheduler de-emphasizes queue-balancing 
in favor of greedily maximizing the system service rate as 
the queue-lengths are scaled up linearly. In this paper it is 
shown that an RSM opportunistic scheduler, p-Log Rule, is 
not only throughput-optimal, but also maximizes the asymptotic 
exponential decay rate of the sum-queue distribution for a two- 
queue system. The result complements existing optimality results 
for opportunistic scheduling and point to RSM schedulers as 
a good design choice given the need for robustness in wireless 
systems with both heterogeneity and high degree of uncertainty. 

Index Terms — Large deviations, multiuser opportunistic 
scheduling, queues sharing time-varying server, scheduling unre- 
lated parallel machines. 

I. Introduction 

We consider a wireless node shared by two users, where 
each user's data arrives to a queue at the node as a random 
process and each user's channel, in terms of data rate that it 
can support, varies randomly over time. If the channel state 
is available, a policy can schedule users so as to exploit 
favorable channels, e.g., schedule the user which currently 
has the highest rate - this is referred to as opportunistic 
scheduling (see, e.g., QJ, Q). More generally, a scheduler 
can be permitted to allocate rates to the users from a compact 
polytope, where the polytope depends on the channel state. 

An opportunistic or channel-aware scheduler, however, may 
not be stable, i.e., keep the user queues bounded, unless it is 
chosen carefully, e.g., using prior knowledge of the arrival 
and channel process. Except in some degenerate cases, in 
order to ensure stability for all possibly stabilizable arrival 
processes, an opportunistic scheduler must be both channel- 
and queue-aware, and must tradeoff maximizing the current 
service rate (scheduling the queue seeing the highest rate) with 
balancing unequal queues (scheduling the longest queue). 
Note that by balancing queues, one can increase the likelihood 
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of more non-empty queues in the future and, therefore, can 
potentially achieve higher service rates in the future. Moreover, 
the performance of any scheduler, in terms of Quality-of- 
Service (QoS) measures like mean delay or probability of 
queue overflow, depends on how this tradeoff is made. 

Much work to date has focused on engineering queue- 
and-channel-aware schedulers that are throughput-optimal, i.e., 
schedulers that can achieve stability without any knowledge 
of arrival or channel statistics, if stability is at all feasible. 
Examples are MaxWeight [3], Exponential (Exp) rule p), 
and Log rule |5|. In fact, necessary and sufficient conditions 
for Max Weight-type schedulers to be throughput-optimal have 
been shown in (6), J7). 

Stability, however, is a weak form of optimality. In view 
of various QoS goals, it is of interest to study schedulers that 
are delay -optimal, e.g., schedulers that minimize the overall 
average delay (per data unit) seen by users, or ones which 
minimize the probability that either the sum-queue or the 
longest queue overflows a large buffer. These schedulers are 
harder to characterize for channels/servers^Jvith time-varying 
capacity, but some results are available that we briefly discuss 
next. 

In (8j and (5) the Longest-Connected-Queue (LCQ) and 
Longest-Queue-Highest-Possible-Rate (LQHPR) scheduling 
policies are introduced. Strong results are shown for these 
policies; they stochastically minimize the max- and sum- 
queue processes, and thus also the tails of max- and sum- 
queue distributions and mean delay. However, in addition to 
assuming certain symmetry conditions on arrival and channel 
statistics, (8| is limited to on-off channel capacities where 
only a single queue can be scheduled per time slot, and 
[9 1 assumes that the scheduler can allocate service rates 
from the information theoretic polymatroid multiuser capacity 
region associated with the current channel state. In both cases, 
the above-mentioned tradeoff between queue balancing and 
service rate maximization is absent. Indeed in |8 |, all policies 
that pick a connected queue result in the same service rate, 
whereas, in the case of [9], all policies that pick a service 
vector from the maximal points of the current capacity region, 
i.e., points on the max-sum-rate face, result in the same total 
service rate. Thus one can achieve the queue balancing goal, 
without compromising the total service rate. Not surprisingly, 
in both cases the optimal policy turns out to be greedy, in that it 
allocates as much service rate as possible to the longest/longer 
queues. 

'The terms channel and server, and sometimes the terms user and queue, 
are used interchangeably through this section. 
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A related server allocation problem is studied in [10|. The 
paper considers minimizing the mean delay in a two queue 
system where each queue has a dedicated server and a third 
server can be dynamically shared between them. As a result, 
the two queues can be allocated service rates from a polyma- 
troid capacity region, thus the objective of queue balancing 
can again be achieved without compromising the total service 
rate. However, without the underlying symmetry assumptions 
of |8j and (9), and using a dynamic programming approach, 
only the existence of a monotone increasing switching curve 
on the state space of queue process is shown; note that the 
switching curve under LCQ and LQHPR policies lies along 
the line where both queues are equal. For a system with 
a general compact, convex, and coordinate convex capacity 
region and any finite number of queues, [ 1 1 1 gives a large 
deviations principle (LDP) for transient queue process under 
MaxWeight scheduler. This LDP can be used to compute, 
e.g, the asymptotic probability of sum-queue or max-queue 
overflow, as well as the corresponding likely modes of over- 
flow. Although the capacity region is not changing over time, 
the region is such that a scheduler must tradeoff maximizing 
total service rate with balancing unequal queues. Therefore 
this result is insightful in relating the modes overflow to the 
tradeoff made by the MaxWeight scheduler. A more recent 



work [12] gives many sources large deviations for MaxWeight 
scheduler for a similar capacity region. 



Finally, relaxing the symmetry assumptions of (8) and (9), 
the works in fl3)-fl5") consider the asymptotic probability of 
max-queue overflow. The server capacity in [131, though time- 
varying, is identical for all users at any given time, thus the 
need to tradeoff queue-balancing versus service rate maximiza- 
tion is again absent. In fact, the sum-queue process in JT3) is 
identical for all work conserving schedulers. However [|14[ 



and [15] consider a server with asynchronously time-varying 
capacity across users. fl4) studies the asymptotic probability 
of max-queue overflow under MaxWeight scheduler and shows 
that for a given system, as the exponent of queue length in the 
MaxWeight scheduler, a, becomes large, the asymptotic prob- 
ability of max-queue overflow under MaxWeight approaches 
the minimum achievable under any other scheduler. A stronger 
result is shown in [15|, that is, the Exp rule scheduler in 
fact minimizes the steady state asymptotic probability of max- 
queue overflow. Indeed the models in p4[ and fl5) accurately 
capture a wireless channel shared by heterogenous users, 
and exhibit the tradeoff between queue-balancing and service 
rate maximization. Existence of this tradeoff also implies 
that, unlike the LCQ and LQHPR policies, the asymptotic 
optimality of Exp rule does not translate to minimizing the 
asymptotic probability of sum-queue overflow or the mean 
delay. In order to minimize the asymptotic probability of max- 
queue overflow, the desirecQ mode of overflow is one where 
all queues (more precisely, the set of overflowing queues 
which then exclusively share the server,) grow at the same 
rate and overflow at the same time. This constrains the system 
throughput, while, of course, keeping the queue lengths equal 



across users. 

Contributions: In this paper we address the problem of 
minimizing the asymptotic probability of sum-queue overflow. 
First, we give a tight lower bound on this probability under 
any scheduler. By contrast with the desired mode overflow to 
minimize the asymptotic probability of max-queue overflow, 
we will see that in order to minimize the asymptotic proba- 
bility of sum-queue overflow, the desired mode of overflow is 
one where the system throughput is the highest possible and 
queues may build up at different rates. Second, we show that a 
radial sum-rate monotone scheduler (see (5J or Section [III] for 
definition of radial sum-rate monotonicity), called the pseudo- 
Log rule, minimizes the asymptotic probability of sum-queue 
overflow. Although our focus is on overflows of the sum-queue 
instead of overflows of the max-queue as in | p3| , the general 
technique of proof in (15) lends itself well to our problem 
and we rely heavily on the results developed therein. Other 
desirable features of radial sum-rate monotone schedulers have 
been explained in ||5), which include, 

(a) reducing mean delay; 

(b) graceful degradation of service in terms of fraction of 
users that can meet their Quality-of-Service requirements 
during transient overloads; 

(c) robustness to uncertainty in traffic and channel statistics. 
The rest of the paper is organized as follows. The system 

model is described in Section [II] Queue-and-channel aware 
schedulers of interest, namely, MaxWeight, Exp rule, and 
Log rule, and the property of radial sum-rate monotonicity 
are reviewed in Section [nij followed by the introduction of 



pseudo-Log scheduling rule in Section IV The main result 
of the paper is summarized in Section |V| Some preliminary 
discussion and relevant large deviation principles follow in 



Section VI The proofs for the lower and the upper bounds 



stated in the main result of the paper are given in Section |VII| 
and |VIII| respectively. After defining local fluid sample paths 
and developing essential results (summarized in Table [I} in 
Section IX the optimality of the p-Log rule, i.e., the last part 
of main result is proved in Section [X] Immediate extensions 
of the main result to some other interesting system models are 
presented in the concluding Section |Xl| 

II. System model 

We consider the problem of dynamically allocating a time- 
varying server to two queues. Each Queue i £ I = {1,2} is 
fed by an independent arrival process (Ai(t), t = 0, 1, • • • ) 
that is i.i.d. over t and where Ai(t) e Z + denotes the number 
of packets arriving in time slot [t,t + 1). We assume that 
the arrivals are bounded, i.e., A;( ) < C for some finite 
C > 0. Let A(t) = (Ai(t), i € I), and vector A = E[A(0)} 
denote the mean arrivals to the queues. We use bold face, e.g., 
(A(t), t = 0, 1, • • • ), to denote the random process and plain 
font, e.g., (A(t), t = 0, 1, . . .), to denote a realization of the 
process. 

2 By the "desired" mode of max-queue overflow we mean the mode which 
gives the lower bound on the asymptotic probability of max-queue overflow 
under any scheduler, as given in 1 1 5 1 ; the likely mode of max-queue overflow 
under the Exp rule is indeed the desired one. 
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A server with randomly varying service rates is available 
to the two queues and modeled as follows. The server has a 
time-varying state that is modeled by an i.i.d. random process 
(rra(t), t = 0, 1, • • • ), where m{t) G M = {1, 2, • • ■ , M} for 
some finite M > denotes the state of the server over [t, t+1), 
and is drawn from distribution it = (wi, ■ ■ ■ , ttm) > 0. Asso- 
ciated with each server state m £ A4 is a vector /x" 1 G 
When in state m over a time slot, the server can either serve 
at most packets from Queue 1, or at most /i™ packets 
from Queue 2. The scheduling problem is thus to allocate the 
server to one of the queues for each time slot such that a given 
optimality criterion is met. This will be formally described 
later. 

At any integer time t, Q(t) = (Qi(t), i £ i) £ 7L\ 
is a random vector, where Qi(t) denotes the number of 
packets in the i th queue at the end of time slot [t — l,t). 
Let St denote a system sample path up to time t, i.e., 
S t = ((m(r),Q{T),A(T - 1)), r < t\ and S t denote the 
space of all feasible realizations S t - Let if : S t — > / denote the 
queue scheduled to receive service during time slot [t,t + 1), 
i.e., we assume that the system sample path St is available for 
making the scheduling decision for time slot [t,t + 1). The 
evolution of the queue process under the scheduling decision 
it(S t ) is given by, 

Qi(t + 1) = (Qi(i) - AC (t V(s t M}) + + Mt) ■ 

The sequence of functions (z^ (•) , t = 0,1,...) is called 
a scheduler or scheduling policy. It is easy to see that 
under a static state-feedback scheduler, i.e., one where 
i*(S t ) =i*(Q(t),m(t)), the process (Q(t), t = 0,1,...) 
forms a discrete time Markov chain on Z+. 

In the sequel, we will extend the domain of all discrete 
time processes and functions to continuous time: a function 
originally defined on integer times has the same value at any 
real t that it takes at ■ All such processes and functions lie 
in the space of real-valued right continuous functions with left 
limits, denoted by T>. We assume that T> is endowed with the 
topology of uniform convergence over compact sets (u.o.c), 
and the fc-times product space T> k with the product topology. 
Lastly, let (fi, J 7 , P) be a probability space that is large enough 
to define all the random processes in this paper. 

For a given weight vector 6 = (61,62) > 0, we are 
interested in finding a scheduler which, informally speaking, 
minimizes the tail of the distribution of weighted sum-queue 
J2iei biQi(')- We will show that under a static state-feedback 



scheduler, namely the p-Log rule described in Section IV the 



asymptotic probability of weighted sum-queue overflow in the 
steady state, i.e., 



Capacity region 

For each server state to £ AL let V rn £ M.\ denote 
the convex hull of vertices (0,0), (0,/!™), and (/x™,0). 
Then, conditional on the server being in state to, the average 
service jointly offered to the two queues under any scheduling 
rule (such that the average exists) lies in the triangle V m . 
Define the capacity region V v as the set of average service 
vectors offered to the two queues under all possible scheduling 
rules, then V„ is a convex polyhedron given by the weighted 
Minkowski sum of regions V m , i.e., 

V„ = mV 1 @---@ir M V M , 

= I K mv(m) : v(m) £ V m , m£M \ . (1) 

KmEM ) 

See Fig. [T]-a for a graphical illustration of capacity region: the 
server has M = 6 states with some distribution it and vectors 
/i™ in set {(1, 4), (3, 4), (1, 1), (4, 3), (4, 1), (1, 0)}. 

Let {ri, • • • , rM'} for some M' < M be the set of strictly 
positive and finite slopes of the outer normal vectors to the 
facets of capacity region V v . The slopes are indexed such that 
< 7*1 < T2 < ... < Tm 1 < co. Also, let tq — and 
fM'+i = 00. For example, see Fig. [T]-a for a depiction of a 
capacity region with M' = 5 facets with outer normal slopes 
in (0,oo). Finally, let % = {v 1 , ■ ■ ■ ,v M ' +1 } denote the set 
of maximal vertices of the capacity region V„. The vertices 
in set V„ are indexed such that v\ > v\ > . . . > v^ 1 +1 , i.e., 
the vertex v m lies at the intersection of the facets with outer 
normal slopes r TO _x and r m ; e.g., see vertex v 2 in Fig. [TJa. 

We assume that there exists a v £ V n such that A < v, 
which is a sufficient condition for stabilizability of the queues 
[3 1. If the above condition is met, then there exists at least one 
static state feedback scheduler under which the Markov chain 
(Q(t), t = 0, 1, . . .) is ergodic. 

Remark 1: Since we assumed that the server can be al- 
located to at most one queue per time slot, the region V m 
is obtained by taking the convex hull of service vectors 
in the set {(0, 0), (/x™, 0), (0, (J*™)}. However, we can relax 
this assumption and allow the server to be shared between 
the two queues during a time slot. That is, we can asso- 
ciate with each server state to a set of k m service vectors 
{(//fa),//^)),--- ,(/if(M,M?(fcm))}, and allow the 
server to operate at any one of these service vectors. In this 
more general case, each region V m will be an arbitrary convex 
polyhedron obtained by taking the convex hull of all feasible 
service vectors associated with server state m. For example, 
V m can be information theoretic polymatroids as in |9|. The 
optimality results presented in this paper will still hold with 
such a relaxation; see Section [Xl] for some details. 



limsup - log P ^6.^(0) > 



is minimized (see Theorem [T] for a formal statement.) The p- 
Log rule depends only on weight vector 6 and does not require 
any knowledge of arrival or server-state distributions. 



III. Throughput-optimal schedulers and radial 

SUM-RATE MONOTONICITY 

The throughput-optimal schedulers MaxWeight, Exp rule, 
and Log rule mentioned in the introduction are all static 
state-feedback. We formally define them next. Let the vector 
fields h mw {-), h ex P{-) and h loa {-) on R 2 + be given as follows: 
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for all x G R 2 + , 
h mw (x) = (bixf, i el), 
h exp (x) = (hcxpf T 

h lo9 (x) = {h log (1 + a lXl ) , i el), 

for any fixed positive &j's, otj's, a, c, and 
< rj < 1. Then, when the system is in state 
(Q(t),m(t)) = (Q,m) €l? + xM, the MaxWeight 
scheduler serves a queue i* mw given by, 



e argmax h r - 



(2) 



augmented with any fixed tie-breaking rule. The Exp rule i* xp 
and the Log rule i ; * are defined similarly by substituting h exp 
and h 9 respectively in place of h mw . Indeed numerous vector 
field based throughput-optimal schedulers can be engineered 
so as to respond differently to the disparity among the users' 
queue lengths, i.e., make different tradeoffs between service 
rate maximization and queue balancing. For reference see, 
e.g., [7 1, which gives necessary and sufficient conditions for a 
vector field based scheduler to be throughput-optimal. 

We refer to a scheduler as radial sum-rate monotone if, 
as the queues scale up linearly, the scheduling rule allocates 
the server in a manner that de-emphasizes queue-balancing in 
favor of greedily maximizing the current service rate. More 
formally, let v(Q) e V n be the vector of average service 
offered to the queues under a static state-feedback scheduler 
i* , conditional on queue state being Q, i.e., 



v{Q) = (E^rVcQ.m)^}], i e I) , 



where expectation is with respect to (random) m drawn from 
distribution tt. 

Definition 1: A scheduling policy i* is radial sum-rate 
monotone with respect to weight vector b > if for any Q and 
scalar 9 such that 9Q e Z™, the weighted sum of expected 
offered service, (b,v(8Q)), is an increasing function of 6, and 

lim (b, v(9Q)) = max ( (b, y) \y G V* and y t = if Q* = 0) . 

0->oo y 

Let v mw (Q) denote the expected service vector under the 
MaxWeight scheduler, i.e., the vector obtained by substituting 
i* mw for i* in similarly, let v exp {Q) and v l ° 9 {Q) denote 
the expected service vectors under the Exp and Log rule 
respectively. Also, fix a weight vector b > 0. Next, under 
schedulers Maxweight, Exp rule, and Log rule respectively, 
we will identify the sets S™ w , Sq XP , and S l ° 9 given approxi- 
matelj^by, 



S 



(•) 



QeZ 2 , 



,(•) 



(Q) e arg max (y, b) 



in words, the set of queue states such that the expected service 
vector v^'(Q) has the maximum weighted sum with respect 
to weight vector b. For example, for the capacity region shown 
in Fig. [I]-a and weight vector b = (1, 1), Fig. [TJ-6-c? illustrates 

Under the formal definition of set Sq given later, the queue states on the 
boundary of set {Q £ 1?^ : v^''(Q) £ argmax^g^ (y, b)} may or may 



not lie in 5 ( 



the sets Sfi lw ,So Xp , and S l Q ° 9 . The set is a cone, the set 
Sq XP a cylinder with gradually increasing diameter, and the 
set S l ° 9 resembles a French horn. The set S l ° 9 is such that 
for any Q > 0, we have 9Q e S l Q ° 9 for all 9 large enough, 
indicating that the Log rule is radial sum-rate monotone. The 
MaxWeight and Exp rule are not radial sum-rate monotone. In 
fact, Exp rule is the 'opposite' of radial sum-rate monotone in 



that, for any Q such that a\Qi ^ a 2 Qi, we have 



SI 



(•) 



for all 9 large enough. A formal description of the sets <Sq 
is as follows. 

The vector tv-''(Q) can be shown to be an outer normal 
vector to the capacity region V n at point v^''(Q) for each 
scheduler (•) e {mw,exp,log}, i.e., 



v (0 (<3) G argmax , 



(4) 



See, e.g., Lemma 2.1 of |T6]. Recall the set of outer normal 
slopes {ro,7"i,--- , rM',fM'+i}> an d let r k be the largest 
slope strictly less than b 2 /bi (i.e. the slope of vector b) 
and ri the smallest slope strictly greater than b 2 jb\\ thus if 
r k+i 7^ ^2/^1 then / = k + 1, otherwise / = k + 2. For 
example, in Fig. [T]-a, for weight vector b — (1,1), we have 
k = 2 and I = 4. For each scheduler (•) G {raw, exp, log} we 
define the set, 



(•) 



x e 



C 2 .„ . 4\ X ) 



■ r k < 



h ( i\x) 

Then, by mi an\ <y - e>„ 1 

v^(Q) e arg maxj / gy' jt (y, b). Each region Sq is bounded by 



< n 



for any Q G S$ H Zj_, we must have 



(3) switching curves given by, 



x e 



p2 . h { 2 \x) 



rk 



and < x e 



n 



For any queue state Q lying on these switching curves, the 
argmax in Q is not unique and whether v^(Q) lies in the set 
argmax ye v^ (y, b) depends on the tie breaking rule associated 
with pi. For each m G {1, • • • , k, 1 + 1, • • • , M' + 1}, we can 
also define a set, 



5 (0 



X G 



r m -i < 



h^ix) 



< r„ 



Then, again by for any Q e Sm H Z^_, we 

must have v^'\Q) = v m . That is, each set Sm 
for m G {1, • • ■ , k.l + 1, ■ ■ ■ , M' + 1} is associated with a 
unique vertex. The switching curve between any two regions 
S m ] and S m ] +1 for m G {1, • • • , k - 1, 1 + 1, • • • , M'} is given 
by, 



IE G 



*4°(s) 



and as before, for any queue state Q lying on this switching 
curve, the argmax in (HI) is not unique and whether it lies in 
set Sm or S m+1 depends on the tie breaking rule associated 
with (|2]). Fig. [T|-&-c? also show the switching curves and sets 
Sm under MaxWeight, Exp rule, and Log rule. In fact, for 
MaxWeight and Log rule, the switching curves can be given 



5 




Fig. 1, (a) The capacity region for /i m £ {(1, 4), (3, 4), (1, 1), (4, 3), (4, 1), (1, 0)}, depicting Minkowski addition, outer-normal vectors and maximal 
vertices; the resulting switching curves under (b) MaxWeight, (c) Exp rule, (d) Log rule, for weight vector b = (1, 1). 



in closed form, e.g., for Log rule, the switching curves are 
given by, 

a 2 x 2 = (1 + a 1 xi) b 2 r - 1 , 
for r € {n, ■ ■ ■ ,r M '}- 

IV. The pseudo-Log scheduling rule 

In this section we introduce a static state-feedback and radial 
sum-rate monotone scheduler, denoted the pseudo-Log (p-Log) 
rule. We subsequently show in Theorem [T] that the p-Log rule 
minimizes the asymptotic probability of weighted sum-queue 
overflow for any weight vector b. The p-Log rule takes weight 
vector b as a parameter but does not require any knowledge 
of arrival or server-state distributions. 

We will define the p-Log scheduling rule through a vector 
field h = (hi,h 2 ) on R 2 + . For any x S [0, l) 2 , let h{x) = 0. 
For any x E R\ \ [0, l) 2 , 



if x\ > x 2l then let 



if x\ < x 2 , then let 



hi(x) = bi^/xi, 

h 2 {x) = b 2 min(a;2, y^i); 

h\{x) = b\ min(a;i, 
h 2 (x) = b 2 ^fx 2 . 



(5) 



The p-Log rule is given as follows: when the queues are in 
state Q £ 1? + and the server in state m £ M., then the server 
is allocated to queue ipL og (Q, m) given by, 



i* P Log{Qi m ) G argmax h^Q)^ 



(6) 



where, in the case of a tie, if Qi > Q2 the server is allocated 
to Queue 1, otherwise to Queue 2. 

^Sl of the vector h(Q) 



Note that it is only the slope, 



that determines the scheduling decision, and so, for example, 
when Qi > Q 2 , Qi 7^ 0, the slope is given by, 



h 2 {Q) = 62 
hi(Q) 61 



mm 1, 



Q2 



Switching curves under the p-Log rule 

Let v pL ° 9 (Q) denote the vector of average service offered to 
the queues under p-Log rule, conditional on queue state being 
Q, i.e., the vector obtain by substituting i* pLog for i* in d3|. 
Recall the set of outer normal slopes {ro, ri, ■ ■ ■ , Tm'i t M'+\\ 
and, in particular, the slopes and n from this set as defined 



in Section III Similar to the sets Sq for (•) e {mw, exp, log}, 
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Fig. 2. The switching curves under the p-Log rule for the capacity region 
depicted in Fig. [TJa and weight vector b = (1,1). 



we define the set Sq for p-Log rule as, 



S 



2 h fh 
x £ K + : — rks/Xi < x 2 < I — r; Xi 



Similar to the region S l °\ the region So too is shaped 
like a French horn, see Fig. [2] for an illustration. For any 
QeS r\Z'l, we have v pL ° 9 (Q) £ argmaXj,^ (y, b). 
Therefore, we will refer to the region Sq as the weighted-max- 
sum rate region (with respect to weight vector 6). Moreover, 
for any Q > 0, we have 9Q £ So for all 9 large enough, thus 
indicating that p-Log rule is radial sum-rate monotone. Also, 
for each m £ {1, • • • , k, I + 1, • • • , M' + 1}, we define a set 
S m as, 



S,i 



X £ 



h 2 {x) 

-l < t-tt < r„ 
hi(x) 



Then for any Q £ S m nZ 2 + , we have vP L °s(Q) 



v m . All 

switching curves in the half plane {x\ > x 2 } are given by, 

bi 

X2 



-Ty/Xl 



for r £ {for*" > r fc} anc l 21 1 — !■ Similarly, all switching 
curves in the half plane X2 > xi are given by, 



02 _i 

b\ 



for r € {r;, • • ■ , rw'+i} and #2 > 1. We will refer to the 
collection of switching curves and regions S m as a partition of 
(or the queue state space Z^_) under the p-Log rule. It will 
be useful to note that this partition, as well as the partitions 
under MaxWeight, Exp rule, and Log rule, depend only on 
the set of vectors {fj, 1 , ■ ■ ■ , [i M } associated with the M server 
states and not on the distribution n = (tti, • • • , ttm) > over 
these states. 

V. Main result 

The following three-part theorem summarizes the main 
results of this paper. It includes a lower bound on the tail of 



the weighted sum-queue overflow probability, an upper bound 
on the same, and the optimality of the p-Log rule. The first 



part is proved in Section VII the second in Section VIII (and 



Appendix), while the last in Sections IX and[X] 

Theorem 1: For the system model detailed in Section [TTJ 
the following hold. 

(i) Given a weight vector b — (bi : bi > 0,i £ I), there 
exists finite To > such that for any t > To and under any 
scheduling rule starting in any initial state Q(0), we have the 
following lower bound, 



hmmf — logP [ biQi(nt) > n ) 
n->oo n \ — ' / 



where J* is defined in Section [VTl| 

(ii) For a stabilizable system, under p-Log scheduling rule the 
process (Q{t), t = 0, 1, . . .) forms an ergodic Markov chain, 
and we have the following upper bound for a random vector 
Q drawn from the stationary distribution of the Markov chain, 

lim sup — log P ( bjQj > n) < — J** , 

n— foo Tl . 

where J** is defined in Section [VIII| 

(iii) The p-Log rule maximizes the asymptotic exponential 
decay rate of the weighted sum-queue distribution, i.e., 

J** - — " J * ■ 

Remark 2: Since a scheduler can, in principle, be non- 
stationary, the lower bound in ( i) is expressed in a more general 
form than the upper bound in (ii) which is specific to a static 
state-feedback scheduler, namely the p-Log rule. For stationary 
schedulers under which the process (Q(t), t = 0, 1, . . .) forms 
an ergodic Markov chain, the lower bound in (i) also implies 
the same lower bound under the steady state distribution of the 
Markov chain. This is because the lower bound will hold if 
the initial state Q(0) were random and drawn from the steady 
state distribution. 

VI. Fluid-scaled processes and large deviation 

PRINCIPLES 

In this section, we define sequences of fluid-scaled processes 
and functions, and a Large Deviation Principle [17] on those 
sequences, which is used in proving Theorem[T] Define the cu- 
mulative arrivals process F = (F(t) = (Fi(t), i £ I),t> 0) 
obtained from the process (A(t), t > 0) as, 

Lt-ij 

Fi(t) = J2 A ^ k ) > 

time the channel i; 

»(*), m G M), f >0) 



and 
G = 



cumulative 

(G(t) = (G. 



in each 
obtained 



state 
from 



the process (m(t), t > 0) as, 

G m (t) = y j H{ rri (fc) = 



L'-iJ 

E 

k=0 



The triplet (Q, F, G), where Q = (Q(t), t > 0) is the queue 
sample path (under a fixed scheduling rule) corresponding 
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to the sample paths (F, G) and initial state Q(0), denotes a 
realization of the system (Q, F, G). For each n = 0, 1, . . ., 
let (Qt"\ F^ n \G^) denote an independent and identically 
distributed system . We define a corresponding sequence of 
fluid-scaled processes, denoted by {q^ n \ f^ n \g^), as, 



» 



with and g( n > similarly defined. 

The arrival and service processes are i.i.d. and bounded and, 
therefore, satisfy large deviation principles |17[ , In particular, 
for each i E I, define for any scalar A; > the rate function 
Li(-) for the sequence (1) as, 

i l (A l ) = sup(0A l -lo g s[e eA 'W 

where Lj(-) = oo over (— oo,0) and (C, oo). Also, for any 
vector A G R 2 + , let, 



L (f) (X) = J2Li(\ 



iei 

For any probability distribution 7 = (jm, m G A4), we define 
the relative entropy L/ g \(-) of 7 with respect to distribution it 

as, 

L (s)(t) = 7m log; 

where Lt g \{-) = 00 everywhere outside the standard simplex 
in . Now consider any functions (f,g) € V 2+AI . If (/,<?) 
are absolutely continuous, then they are differentiable a.e. and 
we let (f'(t),g'(t)) = <?(t)). For any t > 0, if 

(/(0))<?(0)) = and (/, g) are absolutely continuous on the 
interval [0, t], then let, 

Jt(f,g)= I L (f) (f(s))+L [g) (g'(s)) ds , 
Jo 

otherwise Jt(f,g) — 00. The functional Jt(f,g) is referred to 
as the cost of the trajectories (/, g) over the time interval [0, t]. 
The following is a form of Borovkov/Mogulskii's theorem 

CD. 

Proposition 1: For any fixed T > 0, consider 
a sequence in n of the fluid-scaled processes 

(f {n \g {n) ) = ((/ (n) (*),fl (n) (*))> t e [0,T]), then for 
any measurable B C 2? 2+M , we have that, 

- inf {j T (f,g)\(f,g)£B°} 

U,9) 



< 



liminf-logP((/ ( " ) ,g (n) ) € B) 



71— >OQ ft 

1 

n 



< limsup-logP((/ ( ™ ) ,g (n) ) E B) 



< - inf {J T (f,g)\(f,g)€B} , 

U,9) 

where, B° and B denote the interior and closure of set B 
respectively. 

Let u(n) = \n a ] for some fixed a £ (0,0.5). For 
any function d € T) 2+AI , let [/"d denote the piece-wise 
linear function obtained by linear interpolation over samples 
(<f(*H&il), fc = 0,1,...). The following upper bound is 



Stolyar's refinement of Mogulskii's theorem and was first 
introduced in p5| . 

Proposition 2: For any fixed T > 0, consider 
a sequence in n of the fluid-scaled processes 
(/(»>, ff <»>) = ((/W(*),fl (B) (*)), < e [0,T]). _ Suppose, 
for each n there is a fixed measurable B^"-* C I? 2+M , that is 
a subset of the set of feasible realizations of (f^ n \g^) in 



[0,T]. Then, 

limsup - logP ((/ (n) ,g (n) ) e P (,l) 



(7) 



< 



lim inf inf 



{jr(")t/"(/,3)l(/,.9)GS (n) } 



where 7>) = ^1^1. 

Note that by contrast with Proposition [I] in Propo- 

sition [2] corresponds to a sequence of sets of scaled feasible 
trajectories, and the bound on the right side is an infimum 
over the cost of sampled and linearly interpolated scaled tra- 
jectories. Thus this proposition provides a refinement allowing, 
for example, the consideration of large deviations for sets 
{P»} which can distinguish among the trajectories in the set 
{/C ' G V 2 : converges u.o.c. to a Lipschitz /}. That is 
to say, even if all the fluid scaled trajectories in a set converge 
to the same limiting trajectory, the events flW can be defined 
to include only a subset of these trajectories. 

Also introduced in [15] is a notion of generalized fluid 
sample path (GFSP) which naturally appears when applying 
the bound in Proposition [2j we describe this next. Consider 
a sequence of realizations {q^ n \ f^ n \ g^) n — 1, 2, . . . such 
that along some subsequence (still denoted by {n}), we have 
u.o.c convergence 

(q {n) ,f {n) ,g {n) )->(q,f,g) 

to some Lipschitz continuous functions {q,f,g), and u.o.c 
convergence 



t > 



= (j t («)t/"(/ W ,3 (n) ), t > 0) -J- J = (J t) * > 0) 

to some non-negative increasing Lipschitz continuous function 
J. Then the following construct is called a GFSP, 

fl>= [(q in) J in \g (n) ), J (n \ n = 0,l,..- ; (qj,g); J 

and the function J is referred to as the refined cost function of 
the GFSP. Since this construct contains not only the limiting 
trajectory (q, /, g) but also the sequence converging to it, the 
construct is useful when one needs to zoom in and study 
the limiting trajectories (q, f, g) at a finer-than-fluid scaling. 
Moreover, we will see that for the events BW of interest in 
this paper, the bound in Proposition [2] reduces to an infimum 
over the refined cost of a well-defined set of GFSPs. 



VII. Lower bound on overflow probability under 

ANY SCHEDULING RULE 

For any distribution 7 on the set Ai of server states, let 
V 7 denote the corresponding capacity region, see ([TJi. Let 
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that g(")(0) = and (/ (n) ,5 (n) ) converge uniformly in [0, T ] 
to some (/, g) g B t , must have ^ilii^o) > 1 — 2e for 

all sufficiently large n. Therefore, 



liminf - logP [ yhq^\T Q ) > 1 - 2e ] 
\iei / 



> liminf - log P ( (/ (n) ,5 (n) ) E B, 



> 



J * ; 



v, packets/slot 



Fig. 3. For weights bi = 1, the optimal capacity vector v* (A^ 2 ' , 7r) is 
unique, whereas, vector u*(AW,7r) is any point on the annotated part of 
max- sum-rate face of region V n . 



b = (&i, 62) > be the given weight vector. For any vector 
A € R?_, let u*(A, 7) denote a service vector such that, 



where the rightmost inequality follows from the Mogulskii's 
theorem (Proposition [T|. 

Let A^™) = jr/' n '(To)> an d u' n ' denote the average service 
vector seen by the queues; that is to say, uf To is equal to the 
number of packets served from the i th queue over the interval 



[0, To]. By (Hi in the definition of _B e , for all n sufficiently 
large, 



A* 



(0 , , (") 
< A' ' < A* 4- 
4T '4T ; " " V4T '4T 



w*(A,7) € argmax{(&, v) : v < A, v € V^ 7 } , (8) and by (lOi there exists a v <E V^ 7 * such that 



where, if the argmax is not unique, then v*(X, 7) can be set 
to any maximizer. For example, see Fig. [3] depicting u*(A,7r) 
for weight vector b = (1, 1) and two hypothetical vectors A' 1 - 1 
and A^ 2 ) lying outside capacity region VL The interpretation 
is as follows: if the arrival process were to exhibit an empirical 
mean of A and the server state were to exhibit an empirical 
distribution 7, then serving the queues according to the service 
vector w*(A,7) minimizes the rate of weighted-sum-queue 
build-up, i.e., (6, A — v*(X, 7)). 

Finally, we define the minimum cost (per unit increase of 
the weighted-sum-queue) J* as, 



(0 , , («) 

v - l , I < u (n > < V + , 

4To'4TrJ " " V4T '4T 



(12) 



(13) 



We must also have ijW < \( n \ since the total service cannot 
exceed the total arrivals. This, along with inequalities §V2\ 



ii) and (13-i), implies v < A* + (^fr, 2^)- Without loss of 
generality, we assume that the weight vector b is normalized 
to have J^iei h = 2- Then using ( 13 ii), we get, 



M (n) )<(M + ^r 



7 ,A (6,A-«*(A,7)) 



(9) 



A. Proof of Theorem [7]-( /) 

Let (A*, 7*) be a point that achieves the minimum in 
(|9]l, and let T = (b,X* - v* (A* , 7*))~\ We will show 
that regardless of the scheduling rule, all realizations (/, g) 
sufficiently close to (A*f, 7*f) over the interval [0,T ]- and 
thus having cost Jt (f,g) close to J*- lead to an overflow at 
time T . 

Let ||.|| denote the L M norm. For any e > 0, define a set 
of trajectories over the interval [0, To], 

B £ = I (f,g) £ V 2+M : (f,g) is absolutely continuous; 

Vie [0,T ], 

||i (/) (/'(t))-T (J) (A*)ll< ' 



< max (6, y) : y € F r , y < A* 



To 



e e 

2V2t; 



e 



< (M*(A*, 7 *)> + 



(14) 



\L (g) (g'(t))-L {g) (Y 



< 



2T ' 
2T/ 



s'Wi 



inf 



< — ; and 
5T 



\X*-^f(T Q )\\<^- 
Jo o J o 



(10) 



(11) 



The set B e is measurable, compact, and the cost for any 
(f,g) G B e satisfies \\JT {f,g) — J*\\ < e- Next, we will 
show that any sequence of realizations (q^ n \ f^ n \ g^), such 



Finally, using (|12fi) and ( fT4] >, we have that, 

;6,4 W (To)) = ((fo,A(«))-(6, u (»)))r , 

> (<6,A*>-<6,«*(A*,7*)})T -2e, 
= 1 - 2e . 

In order to obtain an overflow at any time after T while 
still incurring a cost close to J*, the trajectory (A*i, 7*t) can 
be prepended with the zero-cost trajectory for an appropriate 
amount of time. I 

VIII. Upper bound on overflow probability under 

THE P-LOG RULE 

Recall the notion of a GFSP and its refined cost function 
from Section |Vl] Let J** denote the lowest refined cost of a 
GFSP that, under p-Log rule, raises X^gi to 1 from 

the initial state q(0) = 0, i.e., 



J* 



inf 

t>o 



(15) 
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where, 



J*,,t = mf|j t |^: g(0) = 0, 5^6<ft(t)>l| . 

The following is a restatement of Theorem fl]-(ii) in terms of 
a sequence of fluid scaled queues. 

Theorem 2: For each n = 1,2,..., consider the system 
under the p-Log scheduling rule in a stationary regime, then, 
the corresponding sequence of fluid-scaled processes is such 
that, 



lim sup - log P ( V" biQi^ 

n— >co H V . T 



(0) > 1 < - J* 



Remark 3: This is the equivalent of Theorem 8.4 of fl5) , 
and its proof (given in the Appendix) follows the same 
framework and uses classical Wentzel-Freidlin constructions 
p8) . The theorem establishes two things: firstly, that the upper 
bound on the probability of overflow when starting with empty 
queues, given by Stolyar's refinement of Mogulskii's upper 
bound, indeed reduces to inf over the cost of GFSPs of 
interest; and secondly, that a GFSP with the cheapest limiting 
trajectories (/, g) that can raise the sum queue 2~2iei ^ili t0 1' 
starting with empty queues, indeed has a cost arbitrarily close 
to the cost of the cheapest trajectory starting in the stationary 
regime. See Appendix for a proof. 

It is clear that J** < J* (since — J* was lower bound under 
any scheduling rule.) To prove the optimality of p-Log rule, 
we need show J** = J*. In the following section, we develop 
the results needed to show this; these intermediate steps are 
summarized in Table [] 

IX. Local fluid sample path 
Let us first motivate the need for defining LFSP (local fluid 

(n) 

sample path.) For each n, define the set Sm as the "fluid" 
scaled version of set S m of the state space partition associated 
with the p-Log rule, i.e., 

S<>) = [ x £ R 2 + : nx£ S m } . 

Then for the n th system, the scheduling decision at time t 
depends on which set Sm , m £ {0, . . . , k, Z + l, . . . , M' + 1} 
(or the corresponding switching curve) the fluid scaled queue 
q( n '(t) lies in. As n — > oo, the characteristic function of 
set Sq converges pointwise to the characteristic function 



of S, 



(oo) 



{x £ 



1% 



x > 0}; while all other scaled 



sets from the partition collapse to one of the axes. Note 
that this is true for the partition under any radial sum-rate 
monotone scheduling rule. Now consider a Lipschitz contin- 
uous limiting trajectory (q, /, g) for the fluid scaled process 
/("),#(")). One can show that, 

if q(t) £ St\ then ^ (b,q(t)) = (b,f(t)}- max (b,v) , 

but if q(t) hits an axis, we lose information about service rates 
of the queues. Hence, we define a LFSP using a finer-than- 
fluid scaling such that the sets of the partition do not collapse 
and we are able to state the proper derivative of the limiting 
queue trajectory. 



Consider a GFSP over some interval [0, T] and fix any 
t € (0, T) such that q(r) ^ 0. Any sequence t^ 1 ' — > r 
has a subsequence along which q^ n ){j^) — > q(r). Set 

<Tn = V '(t'"') / 'y/n, where, qi n] (-) = max l£ ; <7- n) (-)- To 
obtain a local fluid sample path, we will magnify in both space 
and time the fluid scaled trajectories (q( n \ f^ n ',g^) by a 
factor of fr^ 1 , i.e., an order O(^fn) term. More formally, 
for any fixed S > 0, the following re-scaled functions 
(and their limits mentioned subsequently) over the interval 
r T (n) jT («) -|_ a n S], parameterized by s 6 [0,5], are called 
the local fluid sample paths^J for all i G I and m € A4, 



(n) 
>Vi 


'(«) 


1 

cr n 


»(n) 


'to 


1 

cr n 




■(*) 


= oQi 


An) 
J i 


(*) 


1 


Q (n) 


\s) 


1 


•Urn 







in). 



^a nS )~g^(T^: 



Then along some subsequence in n, 



the functions 



(n) 



>/ 



(n) 



ofj 



W) 



converge uni- 



formly over [0, S] to Lipschitz continuous functions 

{oQ, of, <>g); 

• for each i £ I, the function <>g| either converges 
uniformly over [0, S] to a finite Lipschitz continuous 
function or is identically equal to oo; 

• the function converges uniformly over [0, S] to a 
finite Lipschitz continuous function d, or is identically 
equal to +oo or — oo. 

We will refer to the point (g(r), /(r),g(r)J as the GFSP 
source point of the above defined LFSP. Since q(r) ^ 0, it 
must be that <><?,(•) — 00 f° r at l east one i £ I- Note that 
the local fluid queue, q, is merely a re-centered version of 
<>q, i.e., q(s) = <>q(s) + <>q(0), and is always finite by virtue 
of this re-centering. Moreover, the trajectory q dwells in the 
set {x £ M. 2 \x > — o q(0)}, which is at least a half-plane. 
Lastly, we have the following relation between the cost of 
LFSP over [0, S] and the refined cost sequence of GFSP over 

[ T (n) )T (n) + ^5] (see (9 4) of 



Js{of,o9) - Mof>°9) < lim inf — (J^m. 

n— >oo (J 



7 (") ^ 



(16) 



A. Scheduling over time scales of LFSP 

By d6]l the scheduling decision in the interval 

[ r (") )T W _). cr n 5] depends on the slope, 

fe 2 (Q (nrW+n<7 n 3)) 
/ii (g (nrW +na n s)) ' 

and the sign of d^ (s), recall the tie-breaking rule mentioned 
in the description of p-Log rule in Section IlV] 



4 The definitions of o f} " -1 (•) and <>9m ' (') are the same as in [15) , whereas, 
but centered differently. 



*<?!"'(') m ' e scaled as 



s in [15] 



to 



Description 


Stated in 


Relies on 


Under the usual fluid scaling of the queue state space, some sets of the partition collapse (or merge), 
resulting in loss of information concerning the service rate seen by the queue (e.g., when the queue 
lies in any of the collapsed sets.) Therefore, the fluid-scaled state space is locally magnified enough 
to recover the merged sets of the partition; these magnified or finer-than-fluid-scaled trajectories are 
called Local Fluid Sample Paths (LFSP). 


Section |Txl 
Lemma [Tj 


Technique first 
introduced in (4) 


Although the vector field h(-) associated with the p-Log rule is not a gradient field, it appears as a 
gradient field on the state space of finer-than-fluid-scaled queue. A (globally) Lipschitz continuous 
Lyapunov function is then constructed on this state space. Moreover, under the p-Log scheduling 
rule, the Lyapunov function is shown to have a strictly negative drift for all LFSP trajectories having 
low cost per unit time. 


Lemma |5] 


Lemma [I] 


The strictly negative drift of a Lipschitz continuous Lyapunov function translates into a stronger 
implication: for all LFSP trajectories with low average cost per unit time over a given time interval, 
the decrease in the Lyapunov function of the finer-than-fluid-scaled queue must be proportional to 
the length of the interval. Moreover, a sufficient decrease in the Lyapunov function also implies at 
least a proportional decrease in the weighted-sum-queue. 


Lemma [5] 


Lemma [2] 


The above result is used to show that any fluid-scaled trajectory (GFSP) of interest can be 
magnified to obtain a finer-than-fluid-scaled trajectory (LFSP) such that the cost (per unit increase in 
weighted sum-queue) of the fluid-scaled trajectory and that of the finer-than-fluid-scaled trajectory 
are arbitrarily close. 


Lemma |4] 


Lemma [5] and us- 
ing technique of 
Section 11 of (B) 


Under the p-Log rule, no LFSP exists whose cost per unit increase in weighted-sum-queue is strictly 
less than J*, therefore, the least possible cost under p-Log rule, i.e. J tt , must be equal to J+ — the 
upper bound on the cost under any scheduler. 


Section 10 


Lemma |4] 



TABLE I 

Intermediate steps towards proving Theorem I-(iii). 



Without loss of generality suppose q\(r) > q 2 (r) (and 
recall that we had q(r) ^ 0.) Then by ([5j, for n large enough, 
we have that, 



h 2 (Qi 



r(«) 



T n s)) 



b 2 



nam 1 



Q 2 (btW + na n s) 



(n) 



' V^ n) (r (n) +"ns) 



Then, as n — > oo, the above converges to, 

(i,o«te(«) +•&(())) 



b 2 . 

— mm i 

Ol 



where the convergence is uniform on [0,5]. Let us de- 
fine a vector field h over the state space of q, i.e., 
{x G R 2 \x > - o <z(0)}, as follows: 

vhi(x) = bi , 

oh 2 (x) = b 2 min (1, x 2 + O <72(0)) . 
That is, we can restate the above convergence as, 

h 2 (Q (nT (n) + na n s)) oh 2 (»q{s)) 
h x (Q (tit(") + na n s)) ohi(<>q{s)) 



uniformly on [0, 5]. Hence, the switching curves on the space 
of <><7 are given by, 



| (xi,x 2 ) el 



oh 2 (x) _ ^ 
ohi(x) 

(x 1: x 2 ) G M 2 



x 2 



bi 

>q 2 (0) + T-r ri 
b 2 



for m G {0, • • • , fc}, and are now parallel to x\ axis (i.e., the 
axis of <>9i(s), see Fig. WF] For each m G {1, • • • , k}, define 








X 2 


*2 =-^2^) + T/k 






*■ 

x 2 =-A(P)+'t r 2 i 















Si 


p 

(o- ? 2 (0)) 



Fig. 4. Partitions and switching curves on space of local fluid queue q. 

the set S m by appropriately re-scaling the set S m , i.e., 

yh 2 (x) 



ySr, 



{xi,x 2 ) e 

{(xi,x 2 ) G I 

- ♦&(()) - 



r m -i < 



,hi(x) 



-r m -i < x 2 < 



MO) 



(17) 

and define, 



>S = \ (xi,x 2 ) G 
(xi,x 2 ) G 



>h 2 (x) 
>hi(x) 



- o<72(0) + —r k <x 2 \ . 



(18) 



(19) 



See Fig. |4]for a graphical illustration of sets Sq, . . . ,<>5k- 
The service rate /i(s) allocated to the local fluid queue q(s) at 
time s, depends on which re-scaled set S m (or the associated 
switching curve) the local fluid queue q(s) lies in. More 

s Recall that we have assumed, without loss of generality, that qi(r) > 
Q2{t); had we assumed otherwise, we would have found that the switching 
curves on the space of <>9, i.e., {x £ M. 2 \x > — og(0)} are parallel to the X2 
axis (i.e., the axis of «<72(s))- In the sequel, we continue to assume that at 
the GFSP source point, we have qi(r) > q2 (t), and therefore, a h is given 

by jn}. 
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formally, the following lemma relates the service rate /z(s) 
to the vector field h( q(s)), and can be derived without 
much effort from the results shown for the Exponential rule 
scheduler in [40 

Lemma 1: For any LFSP, the following derivatives exist 
a.e. in [0, S] and are finite: 

"' >/0) , 



d_ 
ds 



X(s) 
70) 

o?0) 



ds 
d 

ds 

AO) - i"0) , 



?0) » 



for some, 



iu(s) e arg max (oh( q(s)),v) 



(20) 



Remark 4: Note that if o<7 2 0) > _ o92(0) + 1, then 
y h( q(s)) — b, and the argmax in (20 1 may not be unique. 



However, /i(s) can still be uniquely identified as long as 
od(s) 0. That is, if d(s) > 0, then n(s) is such that ^ti(s) 
is the largest possible among all points achieving max in pD) ; 
similarly, if d(s) < 0, then /i(s) is such that fJ. 2 (s) is the 
largest. The argmax in pO} may again be non-unique if a q{s) 
lies on a switching curve, i.e., 092O) = — oM0) + j 1 ?"™ for 
some m € {0, 1, • • • , r^}, however, in this case /i(s) can not 
be uniquely identified using only the components of LFSP, nor 
will we need to uniquely identify n(s). 

We will also need the following two crucial lemmas that 
show for the p-Log rule what Lemmas 9.2 and 9.3 of fl5\ 
show for the Exponential rule. The following two lemmas also 
implicitly prove the throughput-optimality of p-Log rule, see 
Appendix for details. 

Lemma 2: There exist fixed constants ei > and Si > 0, 
and a Lipschitz continuous Lyapunov function H (constructed 
in the proof below) such that for any regular point s G [0, S], 
if 



-rJ s {ofiog) < £i 
ds 



then — H( q(s)) < -6 r 



Proof If j^J s (<,f,<,g) is small, then A(s) must be close to 
the mean arrival rate A, and 7(5) close to the server state 
distribution it. Since the capacity region V 1 is continuous 
in the distribution 7 (see Q), there exist vectors A* < v* 
such that uniformly on all sufficiently small values of e\, we 
have A(s) < A*, and v* lies in the interior of Wy( s y Let 
Si = mirij e j(u* — A*). By Lemma [T] we have, 



( oh(<>q(s)) , fi(s) ) 



arg max ( /iLg(s)),u ) , 



> ( oh^qis))^* ) 



(21) 



Note that since hi(x) is a function only of Xi, the vector 
field h is in fact a gradient field associated with a (con- 
tinuously differentiable) function^] H, i.e., WH = Ji on 



{x G 



x > 



>q(0)}. Moreover, since ohi(-) is increasing, 



positive, and bounded, therefore, the function H (•) is convex, 

6 Specifically, Proposition 1 and results leading from (34) to (35) of [4] 
relate the service rate seen by the local fluid queue under Exponential rule 
scheduler to the vector field defining the scheduler. 



increasing in each direction, and Lipschitz continuous. Then 
we have, 



ds 



H(.q(s)) 



( /i( g(s)), o( j'(s) ) , 
oh(<>q(s)), A(s) - fi(s) ) 



< 
< 



( 0h[ q(s)), X* 
-Si , 



where the first inequality follows from pT} and the second 
from the definition of 5±. I 
Lemma 3: There exist fixed constants e 2 > and 
82 > S 2 > such that, if 



then, 



^5(0/, off) - Jn{of,og) < , 



H( q(S)) -H( o q(0)) < -S 2 S 



which further implies that uniformly for all large S, we have 
the following bound on the change in the weighted sum queue, 

(b, o q(S)-oq(0)) < S 2 S. 

Proof Noting that H( q(s)) (as a function of s) is 
Lipschitz continuous with some Lipschitz constant de- 
noted by c, the proof of first statement is identical to 
that of Lemma 9.3 of (T5j: pick a positive e 2 < e\, 
let Bi = {s e [0,S] : ^J s (o/,o5) > ^i} and B 2 =5\B X . 
Then the Lebesgue measures of Bi and B 2 satisfy, 
v(Bi) < ffS and v{B 2 ) > (1 - f t )S. Finally, 



H{oq(S))-H{«q$)) 

d 

i Bl ds 

< c^S- 
£l 



S(S_ 



£2 
ei 

(c + Si)) 



ds 



HLq(s)) , 



Ml 
ei 



-)S , 



Fix any positive S 2 < Si, then a sufficiently small e 2 can be 
chosen to prove the first statement. 

To prove the last statement, we proceed as follows. Without 
loss of generality suppose that the GFSP source point of the 
LFSP being considered satisfies q\{r) > q 2 {r), and therefore, 
0/1 is given by (17i. That is, Vi-ff(-) = o^i(-) = ^1 and 
V 2 H{-) = h 2 ( ) < b 2 . Then for any x > y > o <?(0), we 
have the upper bound 



H(x)-H(y) < (b,x-y) 



(22) 



If U2 > —092(0) + 1 (see Fig. Q, then for all z > y, we have 
oh(z) = b and therefore, 

H(x)-H(y) = (b,x-y) . (23) 

If —092(0) < y 2 < — 092(0) + 1, then we have the lower bound 

H{x) - H(y) 

= / ViH(zi,y 2 ) dzi + I V 2 H(xi,z 2 ) dz 2 , 

7 General conditions for a vector field to form a gradient field can be found 
in, e.g., |19) (pp. 944-945). In our case, H (x) is simply additive separable, 
i.e., H(x) = H 1 (x 1 )+H 2 (x2). 
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> (b,x- y) 



Vl) + 
Vi) + 



b 2 dz 2 



>92(0) + l 

'2 

b 2 dz 2 



(12 



b 2 dZ2 



tn 



(24) 



Combining ( |22]^24[ i, we have the following bounds for any 

x>y> <>q(0), 

(b 1 x-y)-{b 1 + b 2 )<H{x)-H{y)<(b 1 x-y) . 
Using this we get, 



-6 2 S 1 



-S 2 S 
h +b 2 



> 
> 



H(.q(S))-H(.q(0)) , 
(b,.q(S) - oq(0)) - (bl 



Now one can take S large enough to obtain a S 2 . I 
Next, using Lemma [3] above and identical to the result in 
Section 11 of p5| , we show that any GFSP that raises the 
weighted sum queue to unity contains a LFSP with cost close 
to that of the GFSP. Subsequently, we will use this lemma on 
a GFSP of cost close to J** (see ( fl~5] >) in order to obtain a 
LFSP with cost close to J**. 

Lemma 4: Suppose a GFSP ip is given that satisfies 
q(0) = 0, (b,q(T)) = 1 for some T > and has a cost 
Jt < 00. Then, for an arbitrarily small e > 0, an LFSP 
(<>q, ©9, <>d, ©/, off) over an arbitrarily large interval [0, S] 
can be constructed from the elements of ip, such that, 



<6,o3(5)-o9(0)) >9S 



(25) 



for some 9 > (independent of e), and cost (per unit increase 
in weighted sum queue) of this LFSP is bounded above by 
Jt + e, i.e., 

Js(of, ©.9) - Mof, 09) 



Proof Components of the given GFSP ip satisfy, 
(Jt 



(26) 



Jo 



= Jt 



(b,q(T)-q(0)) 

For any < ^ < ( 2 < 1 
time t\ = max(t : (b,q(t)) = £1) and 
<2 = min(t > t\ : (b,q(t))= £ 2 )- Then for any e > 
there must exist < £1 < £2 < 1 such that, 

Jt 2 - Jt t -j £ 

(b,q{h) - q{t x )) T+ 2 



define 
time 



(28) 



or (27 1 cannot hold (Dirichlet's box principle). Fix an S > 
large enough as required by Lemma [3] Then there exists a 
sequence {t^} within [£1,^2] such that, 



(b,qM( T M + a n S)-q^(r^)) 



J (r(")+ CT „S) 



-?(») 



> 



< 



. 

Jr + e 



<6,q(«)(r(") +a n S) -g(«)( r ("))) 

where, as before, a n = y q* l \r^)/ sjn\ such a sequence 
must exist, otherwise ( 28 1 cannot hold (another application of 
Dirichlet's box principle, and consequence of the convergence 



of J^ and q( n ^). The above two inequalities can be re-written 

as, 



(&,og (w >OS)-*? (n) (0)) > 0, 

< Jt + e 



-1 f~f( n ) 

( J (r(™)+cr, l S) 



(b,.qW(S)-<>qM(p)) 



(29) 
(30) 



We also have that q(-) 7^ over interval [ti,t 2 ]. Now we 
can pick a subsequence in n along which r' n ^ — > r G [£i,i 2 ] 
and ( o9 ("), <>g( n ), d (?l) , ff (,l) ) converge to 

(©9; o<7j ©rfj 0/7 ©<?)> as described in Section llXl thus ob- 
taining a LFSP (with (9(7"), /(t), <7(t)) being its GFSP source 
point.) From ([29 1 we also have that (b, q(S) — ©<?(0)) > 0. 
Then by Lemma 3] we get, 

Js(of, off) - ^o(o/) 0.9) > £25* , 

which, alongside ( [30| and ( |16) , further implies that for some 
6 > 0, 



(b,og(5)-og(0)> > #S 



and finally, 



W, off) - Jo(o/, off) 

(b,«q{S)) -(6,o9(0)> 



X. Proof of Theorem[T}(iii): optimality of the 

P-LOG RULE 

Recall that to prove optimality of p-Log rule (Theorem [TJ- 
(iii)), we need show J** > J*. We will do this by showing that 
assuming J** < J* leads to a contradiction with the definition 
of J*. 

Suppose J, 



< J*, then by definition of J** in 



15 1, there 
Tor some 



exists a GFSP ip satisfying q(0) = 0, (6, q{T)) = 
finite T > 0, and having a cost Jt < J*- Then by Lemma |4] 

(27) 

from the components of GFSP ip, we can construct an LFSP 



(o9, ©9\ o^i ©/) off) satisfying ( 25 1 and ( 26 1 for an arbitrarily 
large S and an e small enough so that J*** = Jt + e < J*. 

Since ©<&(•) = 00 for at least one i E I, without loss of 
generality suppose ©<7i(-) = 00, thus all switching curves 
on the space of <>9 are parallel to qi axis. Recall that the 
lower boundary of the set o S isgiven by the switching curve 
x 2 = —©172(0) + j^rk (see Fig. 41. Let Si and S2 respectively 
be the first and the last time in~[0, S] such that the trajectory 
©92 (s) < -©92(0) + ^r fe , with Si = S 2 = S if q 2 (s) 
never hits \— <! q 2 (0), —©172(0) + ^i"k\- Note that the trajectory 
of q(s) lies in So in (0, Si) and (S2,S). Then one of 
the following must be true over the interval [0, Si] (similarly 
[S 2 ,S]): 

(a) (6,o9(0)) < (b, q(Si)) and the cost per unit increase in 
weighted-sum-queue over the interval [0, Si] is less than 
J**** i.e. 



JsAof, ©ff) - Jojof, off) 

<&,©9(Si)-©<z(0)) 



13 



(b) (b, o q(0)) < (b, q(Si)) and the cost per unit increase in 
sum-queue over the interval [0, Si] is strictly greater than 

J * $ # 5 l.C 



JsAof, off) - -Mo/, off) 



> J* 



(6,o«(Si)-*«(0)> 

(c) (WO)) > (b,^(Si)). 

If (a) is true for either one of the intervals (suppose its true 
for [0, Si],) we proceed as follows: define vectors 7, A, and fi 
as the average server state distribution, arrival rate, and service 
rate respectively over [0, Si], i.e., 



(A, 7, A) = irj (/'(*)> S'OO'M*)) da. 

By Lemma[T|and the fact that (<>g(s) : < s < Si) lies in So, 
we have /z(s) € axgrnax^gy , . (b,v). This and the linearity 
of /z(s) in 7(3) (see (|TJ) implies /} g axgraaXugv- (6, u). 
Then, 



1 



Si 



(31) 



(6,o«(Si)-g(0)) = (6, A -A) Si 

< ( b, X -v* (A, 7)) Sj 



where w* (A, 7) is as defined in (|8|. Finally, 

JsAof^g) - Jo(of,og) 



> 



> 



(&,♦«(&)-♦«(<))) 
(L (/) (A) + L (g) ( 7 ))Si 

(6, «g(Si) -♦?(<))) 
%)W + L (g)(7 ) 
b,\-v*{\l) 



where the first inequality follows from the assumption that 
(a) is true, the second from convexity of rate functions, and 



the last one from (31 1. However, by the definition of J* in 
d9j, the right side of last inequality cannot be less than J*, 
giving the contradiction we needed; therefore, we must have 

J * — J * * ■ 

Now, if (a) is not true for both intervals [0, Si] and [S2, S], 
then we proceed as follows. Recall that our LFSP satisfies ( |25j ) 
and d26ll, i.e., 



and, 



<& >o g(S)-„g(0)) > OS > 



^s(o/, ©5) ^ Jo(of, o9) 



(32) 



(6, o g(S)- o g(0)> ' 






JsAof, 0.9) - ^0(0/, <>g) 


^ (6, g(Si) 


-o9(0)) 


(&,og(Si)- o g(0)) 


(&,og(S) 


-og(o)) 


Js 2 (of, 0.9) - JsAof, o.9) 


(b,oq(S 2 ) ~ 


-og(Si)) 


(&, g(S 2 ) -<>g(Si)) 


<&,<>g(S)- 


-og(o)) 


Js(o/, o5) ^ Js 2 (of, oS) 


„ (&,og(S)- 


«g(S 2 )) 


(6, g(S)- 9(S 2 )) 


(fo,o9(S)- 


«3(0)> 




Fig. 5. Illustration of original trajectory (»g(s), s S [Si, £2]) and the 
extension (©g(s), s £ [52, S 2 ]) in order to obtain o92(Sj) = 092(51). 



Therefore, if (a) is not true for both intervals [0, Si] and [S 2 , S] 
(equivalently, (b) and/or (c) are true over these intervals), then 
for d33l to hold, we must have, 



J*. 



> 



Js 2 (of,o9) - -Js, (of, off) 



(34) 



(35) 



(6,©g(S 2 ) -<>g(Si)) 
and, for some fixed 9\ > 0, 

(6,©g(S 2 )- g(Si)) > ffi<6,og(5)- o g(0)) 

The above, along with ( [32] ) gives, 

(6,og(S 2 )- g(Si)) > MS. 

Moreover, since (&, <>?(•)) is Lipschitz, for some fixed (9 2 > 0, 

s 2 -Si > 2 (6,o9(s 2 )-o9(Si)) > 92e.es. 

This, along with ( |35| ) and Lemma [3] imply, 

Js a (t>f,*g) - Jsi(ef,o9) > e 2 (S 2 -Si) , 

> e 2 e 2 9i6S . (36) 



Subsequently, we will use ( |33p and ( |36] l to make the quantities 
on the left side of these inequalities as large as needed by 
choosing a large S. 

Consider the trajectory (©g(s), s£ [Si,S 2 ]) which is an 
element of the LFSP ( g, ©g, d, ©/, ©#) over [Si,S 2 ]. 
For some Sj > S 2 , we can append to this LFSP an 
extension over the time interval [S 2 , S 2 ] so as to obtain 
og 2 (S 2 ) = o9 2 (Si) (see Fig. pV Moreover, since the terminal 
values o9 2 (Si) and ©q 2 (S 2 ), lie within the bounded interval 
[~o9 2 (0), — o9 2 (0) + j^fk], therefore the extension LFSP can 
be constructed such that it has a bounded cost, i.e., 

Js'M>»9)-JsM»9)<&, (37) 
and a bounded increase in weighted-sum-queue, i.e., 

\{b,oq(S' 2 ) -©g(S 2 ))| < A , (38) 

for some large fixed A < 00 which is independent of any 
components of the LFSP over [0, S], (e.g. the value of S or 
the terminal values <>9 2 (Si) and <>q 2 (S 2 ).) Finally, the constant 



(33) 



S can be chosen large enough such that, by (35 1 and (38 1, we 
have, 

(&,©g(S 2 )-©g(Si)) > , 
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and by (|34]>-([38), for some small €4 > that satisfies 

J*** + £4 < J*, we have, 



Finally, 



Js£(o/>og) ~ Js! (of, o9) 
(b,MS'2))-(b,oq(Si)) 



< J* 



e 4 < J, 



(39) 



Set Co = —o<?2(0) and for each m G {1, • • • , k}, choose 
a c m G f-o&Cs) + ^r m -i,-o92(s) + s|^m) such that 
the counting measure of set {s G [0, S' 2 ] : <>q(s) = c m } 
is finite; such {c m } exist since q is Lipschitz. 
Lastly, choose a c^+i < 00 large enough such that 
max se [ SljS /] q 2 (s) < c k+1 . For each m G {1, • • • , fc + 1}, 
define a set C m = {s£ [Sx,S' 2 ] : o^O) G [c m _i,c m ]}. We 
make the following three observations which will be used in 
the sequel: for each rn E {1, • • • , k + 1}, 

(i) the trajectory (<>q2(s), s G [Si,^]) intersects with end 
points of interval [c m _i,c m ] only finitely many times, 
therefore, the corresponding set C m can be written as a 
union of finitely many intervals; 

(ii) if set C m is non-empty, then 



q 2 ( min s) 

s£C m 



>?2(max s); 



(iii) the trajectory ( q(s), s G C m ) can intersect with at most 
one switching curve x 2 = —<>q2(s) + j^r m -i, or equiv- 
alently, with at most two adjacent regions <><S(.), (e.g., 
for m = k + 1 the above trajectory can intersect with 
adjacent regions Sk and Sq). 

Then using (i) and (ii) above, for all sets C m , we must have, 



/ <>q 2 (s) ds = . 
Jc m 

Moreover, there exists a set C m such that, 

/ (b, q'(s)) ds>0 
Jc m 

and, 

I Cm (%)(*/'(«)) + W«g'( s ))) <>« 

In ( & >o</0)) ds 



(40) 



< J» 



otherwise ( |39] l will not hold. With a set C m for which the 
above two hold, let, 



7, Aj = 



1 



>f'(s), g / (s),fi{s)) ds 



By (iii) above, (18-20l, and the fact that d(s) > in 



[Si, S'2], the service vector fi is a convex combination of 
at most two adjacent vertices of capacity region V* and 
lies on the facet with outer normal slope r m -i- That is, 
fi is necessarily a maximal element of V^. This, together 
with the fact that A2 = [i 2 which follows from (40 1, gives 
fx = arg min ve v- (b,(X — v) + \ = v* (A, 7), and then, 



(b, q'(s)) ds = (b,\-V*(\,*f)) v(C m ) 



J* > 



> 



Sc m { L (f)(«f'( s ))+ L ( a )(«9'(s))) ds 

Ic m ( b >°<l'( s )) ds 
L (f) (\) + L (g) (j ) 

6,A-u*(A, 7 ) 



By the definition of J*, the right side of last inequality cannot 
be less than J„, giving the required contradiction; hence we 
conclude that J* = J**. I 

XI. Conclusion and extensions 

In order to minimize the asymptotic probability of weighted- 
sum-queue overflow, the desirable mode of overflow is one 
where queues may build up at different rates, however, the 
total weighted service rate seen by the queues is the highest 
possible, (service rate subject to being not more than the arrival 
rate.) The p-Log scheduling rule minimizes the asymptotic 
probability of weighted-sum-queue overflow and exhibits such 
a mode of overflow. This property of p-Log rule is related to 
the collapse under fluid scaling of all but one set of the state 
space partition to either of the axes; the set that does not 
collapse is the horn-shaped weighted-max-sum-rate set. This 
collapse under fluid scaling is typical of the partition under 
any radial sum-rate monotone scheduler. However, the scaling 
or magnifying factor required to obtain a useful LFSP, and 
the shape of the sets of partition on the local fluid state space 
will vary for different radial sum-rate monotone schedulers. 
In this regard, p-Log rule yields an easily tractable partition 
of the local fluid space where all switching curves are parallel 
to one of the axes. 

Recently in |26|, the authors have reported a promising 
framework to relate the gradient field associated with a 
MaxWeight-type scheduler to the modes of overflow and large 
deviations of the appropriately scaled queue process. They are 
able to show that the Log rule indeed minimizes the asymptotic 
probability of sum-queue overflow; the framework, however, 
does not cover the p-Log or Exp rule since the vector field 
associated with either of these schedulers is not a gradient 
field. In this regard, Lemmas [2] and [3] of this paper suggest that 
it may be possible in some cases to locally replace the vector 
field with a gradient field and thus obtain a suitable Lyapunov 
function, and relate the negative drift of this Lyapunov function 
to that of the quantity of interest (weighted sum-queue in our 
case). 

The following extensions of the main results of the paper 
as well as the system model are possible without much effort. 

First, the lower bound (i.e., Theorem [Tj-i and its proof) goes 
through without any changes for any fixed number of queues 
(instead of only two) sharing the time-varying server. 

Second, the main result (Theorem [TJ is also applicable to 
the following different and simpler system model. Instead of 
a single server with time-varying state m(t) G {1, 2, . . . , M} 
having distribution n, suppose there are M distinct servers 
with fixed but asymmetric capacities across the two queues; 
such servers are typically called parallel "unrelated" machines 
(see, e.g., (271). More specifically, the m th server, if allocated 
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to the i € / queue, can serve "Kill™ <E Z + packets from 
the queue. The total service offered to a queue is taken to 
be the sum of service offered by each server assigned to that 
queue. Then the scheduling problem is to dynamically assign 
the servers to the queues based on the queue state. When queue 
is in state Q, the p-Log scheduler in this context allocates the 
m th server to the queue ipL og (Q , ni) as given by (|6j. This 
system model is simpler in that the only random process now 
driving the system are the arrivals, but is also different from 
the original model in that there are multiple parallel servers 
with each server having asymmetric capacities across the two 
queues. 

Third, the main result also goes through if the capacity 
regions V m are permitted to be arbitrary convex polyhedra 
instead of just triangles obtained as a convex hull of service 
vectors (0, 0), (/i™, 0), and (0, (J,™). That is, in a more general 
model, in any server state to, the server can be permitted to 
operate at any one of the k m service vectors from the set, 

{«(l),/xni)),"-- ,{^(k m ),^(km))} ■ 

The region V m then will be the convex hull of the k m 
vertices associated with state to. The only change needed is 
to generalize the definition of p-Log rule as follows. When 
the system is in state (Q, m), operate the server at a service 
vector fJ.* pLog (Q, m) <E V m given by, 

V* P Lo g {Qi m ) G arg max (y,h(Q)) , 

where, in the case of a tie, if Q\ > Q2, then fj, * P Log (Q,m) 
maps to the maximizer with the largest capacity for Queue 1, 
otherwise Mp^ og (<3, to) maps to the maximizer with the largest 
capacity for Queue 2. This generalization affects the fluid and 
local fluid sample paths through ( |20| in Lemma [T] which can 
be shown to hold exactly as in [20], | 2~T) . 

Besides the above extensions, we will conclude by stat- 
ing one more interesting application of the p-Log scheduler. 
A throughput-optimal scheduler can also be used to offer 
minimum and maximum average service rate guarantees to 
infinitely backlogged queues, referred to as tasks, sharing a 
time-varying server/wireless channel |22|, J23) . This is done 
by using virtual token queues that are fed by deterministic 
arrivals at a constant rate Xi, and making scheduling decisions 
to serve tasks based on the virtual token queues (augmented 
with a scheduling rule to use when all token queues are 
empty). If rates \ are feasible (i.e., vector A lies in the interior 
of capacity region associated with the time-varying server), 
then under any throughput-optimal scheduler, each task i will 
be offered an average service rate Vi > A, (such that v G V^). 
However, if rates Xi are not feasible, then main result of this 
paper implies that the average service rate vector v has the 
following interesting and desirable property under p-Log rule: 
(b,v) is maximized subject to %\ < A,;. That is, p-Log rule 
splits the tasks in two sets, for one set of tasks Vi = Aj, 
whereas for the other V{ < Xi, and the sets are chosen such 
that the total weighted service rate (b, v) is maximized. 

Appendix 

We begin by developing the necessary results need for the 
proof of Theorem[2] The corresponding proof in fl?) has some 



parts- lemmas and theorems- that are not specific to the Exp 
rule; these mostly go through by interpreting (i) (which is 
the notation for max-queue in fl5) ) as weighted-sum-queue 
J2iei biqi{t), while other lemmas and theorems require more 
work specific to the p-Log rule. We proceed by stating the 
following two theorems whose proofs are short and identical to 
those of Theorem 8.5 and Theorem 8.6 in [15] by interpreting 
<l*{t) as £) i6/ 

Theorem 3: (See Theorem 8.5 of (T5j) For any fixed T > 
and < c < 1, let us denote 



J**,<t,c = inf jj t |-0 : y^Mi(O) < c and 

iei 

^2 & *&(*) - 1 for SOme 1 - T }- 

iei 

Then, we have, 
limsup — log sup p( sup S~] biq^ n \t) > l) 



< - J* 



*,<T,c 



and as c —> 0, 



<T,o = inf . 



(41) 



Theorem 4: (See Theorem 8.6 of |15|) For any fixed 
C* > S > 0, and T > 0, let us denote 

K{C*,5, T) = inf I J T \ij> : ^ h qi (0) < C* and 



^fei<fc(t) > 5 for all t e [0,T]| 



Then, we have, 

limsup — log sup p( inf ft) > S 

™ n E !e ^ l9l M (o)<c- Kt ^t J 1 

< -K(C*,S,T) . 

Theorem 5: (See Theorem 8.7 of (BJ) For any C* > 0, 
there exists Ai > such that for all sufficiently large T and 
all S € (0,C*), we have K(C*,S,T) > A t T. 
Proof It is clear that K(C* , 5, T) is increasing in 5. We will 
show that there exists an £5 > and 65 > such that for any 
GFSP satisfying (6, q(-)) > over [0, T], 

if Jt-Jo< t 5 T, then (b, q(T) - g(0)> < -8 5 T , (42) 

hence for all T > C*/S 5 and all S € (0, C*), we must have 
K(C*,5, T) > e 5 T, thus the desired result. 

Fix S large enough as required by Lemma [3] and 
recall £2 and 62 therein. For each n, define sequence 
{ T l n \ l = 0,1, ...,/*} in interval [0,T] such that t { 



(n) 



0, 



and, 



.(«) 



» 



where cr n (-) = y qi n \-) /y/n, and /* is the largest integer (that 

(n) (71) 

depends on n) such that r ; » ; < T, Let B2 be the union of 
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intervals [t^"\tS] over which the refined cost is strictly less 



than e20n(77 )5, i.e., 



B { 2 n) = { U 



'/ 



-j(ni -jyn) . [„),„•, 



f(n) 



and = [0,T] \ B { 2 n) . Now pick a positive e 5 < e 2 /2. 

From this point on, we assume n large enough. The Lebesgue 



measures of and B 2 l> satisfy v{B^ l> ) < 



(n) 



and v{B { 2 n) ) > (1 



2< .-, 



'-T 



)T. Moreover, over any interval 



[r t , Ti?\] C B [ 2 n> , we must have, 

(M(")(r^)- g (»)(r/")))<-|a„(r/"^. 

otherwise we could construct a LFSP contradicting Lemma [3] 
by choosing a subsequence in n along which the above in- 
equality does not hold and converges to some r € [0, T]. 
Then we have the following bound on the total increment of 
(M (n) (')) over B { 2 n) , 



(n) 



E 



lCBi 



M w (l8)-9 (B) rf' ,) ) 

<*2 



< 



(1 



2e 5 



)T. 



2 £2 

Also, because we have assumed bounded arrivals, there is a 
finite ci > such that the total increment of (b, q^ n '(-)) over 
[0, T] \ B { 2 n) satisfies, 



(b,qW(T)- q M( Tl .) 



E 



„(»)((-); 



ICS 



< Ci — 1 . 

£2 

'i+iJt-"l 

Finally, summing the above two inequalities and taking limit, 
we get, 

»(0) 



(b,q(T)-q(0)) < 



q 



< 



limsup(M (n) (T) 
-T(|-^(^ + 2 Cl) ) 

^ £2 



Now we can fix a positive <5s < 82 /2 and choose 65 > small 



enough to satisfy (42 1, completing the proof. 



Remark 5: While the reader may conclude the throughput- 
optimality of the p-Log rule from Lemma [2] alone, neverthe- 
less, the throughput-optimality explicitly bears out as a corol- 
lary of the above theorem. Specifically, ( |42| shows that for any 
zero cost limiting trajectories (f,g,q)- i.e. (f( n >, , q^™* 1 ) 
converge u.o.c to (f,g,q) with probability 1- with initial 
condition (b, q(0)) = 1, we have (b,q(T)) = for all 
T > i. That is, for any deterministic initial state satisfying 
(&,g (n) ( )) < 1, we have (b,q^(T)) with probability 
1. Convergence with probability 1, along with the bound 
(b,q^(T) - q (n) {Q)) < C{b x + b 2 )(T + 1) (recall the 
assumption of bounded arrivals), implies convergence in the 
mean too, and therefore, 



lim sup 



sup 



E 



E^ n) ( T ) 



= 0. (43) 



By 1 24 1 (or see Theorem 4.1 of |4),) this proves the 
throughput-optimality of p-Log rule. 

Fix a time T > and constants C* > S > 0. Just as ( |4"3"j ) 
was obtained from ( [42) 1, the following too can be shown. For 
all large n, uniformly on (b, q {n) (0)) < C*, if (b, q (n) {-)) > S 
over [t, t + T], then, 



sup E 



b,qW(t + T))-(b,qW(t) 



<*5 

< — -T 
~ 2 



The above along with Dynkin's formula (see Theorem 19.1.2 
of | [25| ) implies the following result. 

Lemma 5: Let constants C* > S > be fixed. Consider 
the stopping time, 

(3^ = inf {f > : (b,q M {t)\ < 5} . 

Then, for all sufficiently large n, uniformly on the initial states 
with (b, g (,l) (0)) < C*, we have, 

E[ f3 (n) } < A 2 C* , 

for some finite A2 > 0. 

Proof of Theorem [2] Now we can proceed with the proof 
of Theorem [2] Consider the scaled (random) process q( n \ 
For any fixed constants C* > 1 > e* > e > S > 0, define the 
following stopping times, 

aW = inf{t>0: (b, q (n) > l} , 

/3 (n) = inf{t>0: ^,q (Tl) (<)) < (5} , 

77W = inf{t> y aW: (b,qW(t)} > e\ . 

Let p' n ) denote the stationary distribution of process q^ n \t), 
p x n ^ its distribution conditional on q(™)(0) = x, and E x 

(n) 

expectation under p x . Then, for any arbitrary T > 0, it is 



easy to show (see (8.15) 1 15 1) the following upper bound on 
the probability of overflow, 

SUP,:(^)<C^[/3 (n) ] 



EW n) > 1 



< 



sup 

x:{b,x)<e 



inf«(M>>e^[»?W] 



(44) 

Now, by uniform upper bound in Lemma [5] we 
have lims\ip n ^ 00 sup y . {hy) < G *E y [pW] < A 2 C*; 
and by bounded arrivals, we have the lower bound 
liminf„_ i . 00 inf z .^ 2 ^> e E z [r]^] > 0. Therefore, the 
terms that are going to decide the limit, 

lim sup - log p (n) ( ^2 > 1 ) 1 

are p x n) (^W > T) and pi n) < T). 

By Theorem [5] we can choose T large enough such that 
K(e*,S,T) > K{C*,8,T) > J„. Then by Theorem |5] 

lim sup - log sup p<W (/3 (,l) > T) 

lf(C*,*,T) < - J** . 



< 
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By ( |4T| i and the definition of J** in ( fl5| >, for any £6 > 0, we 
can choose an even larger a T (if required) and an e* > small 



enough such that J** <y ^ 



> J* 



e 6 . Then by Theorem [3] 



lim sup — log sup 

n->oo ?1 x:(b,x)< 



,(«) 



< T 



< 



J,. 



,<T, ( 



< 



Since we can choose eg arbitrarily small (and subsequently fix 
constants 1 > e* > e > 5 > 0), substituting the above two 



bounds in (44 1 completes the proof. 
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